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ABSTRACT 

It is possible that ultra-high energy cosmic rays (UHECRs) are generated by active 
galactic nuclei (AGNs), but there is currently no conclusive evidence for this hypoth- 
esis. Several reports of correlations between the arrival directions of UHECRs and 
the positions of nearby AGNs have been made, the strongest detection coming from 
a sample of 27 UHECRs detected by the Pierre Auger Observatory (PAO). However, 
the PAO results were based on a statistical methodology that not only ignored some 
relevant information (most obviously the UHECR arrival energies but also some of the 
information in the arrival directions) but also involved some problematic fine-tuning 
of the correlation parameters. Here we present a fully Bayesian analysis of the PAO 
data (collected before 2007 September), which makes use of more of the available in- 
formation, and find that a fraction -Fagn = ^-lbt^'l^ of the UHECRs originate from 
known AGNs in the Veron-Cetty & Veron (VCV) catalogue. The hypothesis that all 
the UHECRs come from VCV AGNs is ruled out, although there remains a small 
possibility that the PAO-AGN correlation is coincidental (-Fagn = 0.15 is 200 times 
as probable as -Fagn = 0.00). 
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1 INTRODUCTION 

Cosmic rays (CRs) are highly accelerated protons and nu- 
clei that reach Earth with arrival en ergies in the wide range 
10 s eV < £arr < 10 20 eV (see, e.g. IStokerl [20091 ). However 
' the origin of ultra-high energy cosmic rays (UHECRs) with 
E alI > 10 19 eV, in particular, remains uncertain. The most 
promising theory is that UHECRs are generated by active 
galactic nuclei (AGNs), and there are several physica l mod- 
els to motivate this idea fe.g. lProtheroe &: Szabolil992l : lDiehll 

l2009l h but this hypothesis requires empirical verification. 

A number of difficulties hinder efforts to gain exper- 
imental evidence about UHECRs. The most fundamental 
problem is that CRs are deflected by the Galaxy's mag- 
netic field: the arrival directions of lower energy protons are 
essentially independent of their point of origin, although 
UHECRs are expe cted to be deflect e d by no more than 
a few degrees (e.g. iDolag et alj|2005l : | Medina Tanco et al.l 

Il99ct ). It is also problematic that UHECRs are very rare, 
with the observed number flux falling off with energy 
as dr oba /dg arr ~ fg arr /(10 19 eV)]" 2 ' 6 s" 1 m" 2 sr" 1 (e.g. 

I Abraham et alj|2010h . The fall-off is expected to be even 



E-mail: l.watson09@impcrial.ac.uk 



more extreme above -Egzk — 5 x 10 19 eV, as protons 
at these energies interact with cosmic mi crowave back- 
ground (CMB) photons to produce pions (|Greisenl Il966l ; 
IZatsepin fc Kuz'min|[l96rj . hereafter GZK). The GZK mean 
free path between interactions for an E ~ 10 20 eV pro- 
ton is only about 4 Mpc, and each interaction typically 
reduces a CR's energy by approximately 20 per cent (e.g. 
R achen &BiermamJh993). so any observed UHECRs must 
have originated within an effective 'GZK horizon' of about 
100 Mpc. (If, alternatively, UHECRs are primarily Fe nuclei 
the GZK horizon is expected to be even smaller, although 
the deflection due to magnetic fields is greatly increased.) 
The GZK effect reduces the number of detectable UHE- 
CRs, but a fortunate consequence is that it also reduces 
the number of plausible AGN sources to the few thousand 
with distances of D < 100 Mpc or, equivalently, redshifts 
of z < 0.03. This makes it plausible to search for a correla- 
tion between the arrival directions of UHECRs and locations 
of local AGNs, provided sufficiently many UHECRs can be 
observed. 

The problem of the low UHECR arrival rate can only 
be overcome by using a large collecting area, and by ob- 
serving for long periods of time. At present, the largest 
CR observatory is the Pierre Auger Observatory (PAO; 
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I Abraham et al.|[2004l ). which has been operational since 2004 
January. During its first 3.6 years of observing, the PAO 
made reliable detections of the arrival directions and en- 
ergies of 81 UHECRs, of which 27 had an (esimated) ar - 
rival energy of E obs > 5.7 x 10 19 eV (| Abraham et al.ll2007tj ). 
These 27 events were found to be strongly correlated with a 
sample of local AGNs in the IVeron-Cettv fc Veronl |2006l . 
hereafter VCV) catalogue; this was the first strong em- 
pirical support for the hypothesis that UHECRs are gen- 
erated by AGNs. The PAO has continued to operate in 
the time since thes e results were obtained; the latest data 
(|Abreu et alJ l2010h show a much weaker correlation. As 
discussed in iBeattv fc Westerhofll \2Q0$ ), there have been 
many attempts to find a correlation between AGNs and 
UHECRs, using a variety of techn i ques a n d data, such as 
those reporte d by Nemmen et al.l (|2010h. I Abraham et al.l 
d200SL l2007bh. lAbbasi et alJ (|2008l ). bhisellini et alJ J200gh 
and lGeorge et all (|2008f ). In particular. I Abbasi et all (|2008l ) 
claim no significant correlation. 

Given the small numbers of UHECRs on which the PAO 
results are based, some care must be taken with statisti- 
cal methods, both to ensure that all the available informa- 
tion is utilised and to avoid over-interpretation. These aims 
can be achieved by adopting a Bayesian approach in which 
the relevant stochastic processes (e.g. the GZK interactions 
of the UHECRs with the CMB, deflection by the Galaxy's 
magnetic field, measurement errors) are explicitly modelled. 
Even though the details of some of these processes are not 
well known (most relevantly the strength of the magnetic 
fields and the energy calibration of the UHECRs) , such un- 
certainties can be accounted for by marginalisation. Whe reas 
the simple correlation analysis of lAbraham et ail |2007bl ) ig- 
nores the arrival energy of the individual UHECRs, implic- 
itly assuming that the GZK horizon is independent of E alI , a 
likelihood-based approach can incorporate the fact that the 
very highest energy events are expected to have come from 
the most nearby AGNs. Similarly, the use of circular angular 
regions to match UHECR arrival directions and AGN posi- 
tions is both sub-optimal (as real matches would tend to 
be more centrally concentrated) and potentially misleading 
(because any resultant statistic is as sensitive to physically 
implausible correlations as to the tighter angular matches 
that would be expected if the AGNs were the UHECRs' 
progenitors) . 

While neither ignoring the individual UHECRs' arrival 
energies nor using a hard effective GZK horizon are neces- 
sarily inconsistent, both choices decrease the constraining 
power of the data-set. For instance, a strong prediction of 
the AGN hypothesis is not only that UHECRs' arrival di- 
rections will be correlated with nearby AGNs, but that ev- 
ery AGN-sourced UHECR will be directly associated with 
at least one candidate source, and that the most energetic 
events will come from the closer AGNs. Critically, it is pos- 
sible to use a physical model of UHECR generation, propa- 
gation and observation, hence extracting much more of the 
valuable information in a UHECR data-set than is possible 
with other, more heuristic, analysis methods. 

In this paper we take the first steps to developing a com- 
prehensive Bayesian formalism for analysing UHECR data. 
Our starting po int is to reanalyse the UHECR and AGN 
samples used by I Abraham et aT, (|2007bT ) . changing only the 
statistical method. Aside from providing an answer to the 



question of whether the 27 PAO UHECRs come from the lo- 
cal VCV AGNs, we will show directly how the results depend 
on the statistical method used to analyse such data-sets. Af- 
ter describing the UHECR and AGN samples in Section [2] 
our statistical method and CR propagation model are pre- 
sented in Section [3] The results of applying this methodol- 
ogy are given in Section [4] and the overall conclusions are 
summarised in Section [5] 



2 DATA 

The sample of UHECRs (Section^T) and the putative AGN 
sources (Section 12.21) an alysed here are the same as used by 
lAbraham et all (|2007bl) . 



2.1 PAO observations of UHECRs 

The PAO South is located near Malargiie in Argentina, at 
a longitude of 69? 4 and a latitude of —35? 2. It has 1600 
surface detectors (SDs) that cover an area of 3000 km 2 , as 
well as four arrays of six atmospheric fluorescence telescopes. 
The PAO recorded N c = 27 UHECRs with reliable detected 
energies of E ohs > E m - m = 5. 7 x 10 19 eV between 20 04 Jan- 
uary 1 and 2007 August 31 (| Abraham et alj|2007bl ). These 
27 events are shown in Fig. [T] 

The arrival direction s of UHECRs are mea sured with an 
accuracy of about 1 deg (|Abraham et al.ll2008l ) by the PAO, 
although there is an additional effective uncertainty in the 
progenitor directions as the UHECRs are deflected by Galac- 
tic and inter-galactic magnetic fields. The magnitude of this 
effect is somewhat uncertain, with estimates of the typical 



deflection angles ranging f rom 2 deg (e.g. iDoiag et al ll2005l ; 



Jig: 

iMedina Tanco et all 119981 ) to 10 deg (e.g. ISigl et alJ \2004 ) 



for E m[n ~ 10 20 eV UHECRs. Despite lack of knowledge 
about the magnetic field strengths, the combined effect of 
both the deflection and the errors in the directional recon- 
struction is to ensure that the observed arrival direction, 
f bs, and the unit vector to the progenitor, f src , are sepa- 
rated by, typically, a smearing angle of a few degrees. We 
model this process by defining the conditional probability 
distribution of observed arrival directions of UHECRs from 
a source at f src as a two-dimensional Gaussian on the sphere, 



Pr(?*obs |^src) 



2tto- 2 (1 



-2/ct 2 



■ exp 



1 - fobs 



(1) 



We assume a fiducial smearing angle of a — 3 deg unless 
otherwise stated, but also calculate results using a = 6 deg 
and a = 10 deg for comparison purposes. 

Over the 3.6 years that the 27 UHECRs were detected, 
the effective area of the PAO increased steadily, but the 
evolution was sufficiently gradual that the exposure per 
unit solid angle, de/dfi (which has units of area x time) 
is a function of declination only. The angular dependence 
of the PAO exposure can be approximated by assuming 
that the instantaneous exposure is constant within 60 deg 
of the zenith and zero otherwise. (The detailed angular de- 
pendence is dominated by the cross-sectional area of the 
SD array, and there are smaller corrections due to the vari- 
ous PAO data cuts, but these secondary effects are ignored 
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Figure 1. The arrival directions of the N c = 27 PAO UHECRs (black points) and the source- weighted exposure (greyscale: darker 
indicates greater exposure) for the background-only model (left) and the AGN-only model (right), in Galactic coordinates. The Galactic 
Centre (GC), South Celestial Pole (SCP) and PAO's field of view (FoV) are all indicated. Lines of constant Galactic latitude |6| = 10 deg 
are also shown. 



here.) Integrating the instantaneous e xposure over t i me to 
account for the Earth's rotation (cf. iFodor fc Katzl 1200 ll ) 
yields the declination-dependent exposure e(f) shown in the 
left panel of Fig.[T] The total exposure, e to t = / (de/dQ,) dfi, 
for the PAO observatio ns considered here is 9000 yr km 2 sr 
^Abraham et al.ll2007al ). 

2.2 Local AGNs 

We follow I Abraham et all J2007bl) in considering only A GNs 
in the 12th edition of the IVeron-Cettv fc Veron] |2006l ) cat- 
alogue as possible sources for the PAO UHECRs. The dis- 
tance to each source, D, is calculated from the quoted ab- 
solute and apparent magnitudes in the VCV catalogue, and 
AGNs without absolute magnitudes are omitted. The full 
VCV catalogue contains 108 014 AGNs, but only N s = 921 
have z b s ^ 0.03 and are hence plausible UHECR progeni- 
tors inside the GZK horizon of about 100 Mpc. 

The VCV catalogue is heterogeneous, having been com- 
piled from a variety of AGN and quasar surveys and, as 
such, it is not ideal for statistical studies. It is, however, ex- 
pected to be close to complete for the local AGNs of interest 
here, except close to the Galactic plane. Moreover, as empha- 
sised in Section [T] the VCV sample was ch osen specifically 
to facil litate comparison with the results of lAbraham et all 
l|2007bl ). 



3 STATISTICAL METHOD 

Do the observed arrival directions of the 27 PAO UHECRs 
provide evidence that at least some of them were emitted by 
known nearby AGNs? We answer this question by using a 
two-component parametric model characterised by the rate 
at which UHECRs are emitted by each AGN, r src , and the 
rate at which an isotropic background of UHECRs arrive 
at Earth, If none of the UHECRs come from the 

1 The two rates have different units: r src is the average number 
of UHECRs emitted per unit time by an AGN, and is given in 
units of s" 1 ; -Rbkg is the average number of background UHECRs 
arriving at Earth per unit time, per unit area, per unit solid angle 
and is given in units of s~ 1 m -2 sr -1 . 



candidate AGNs then the data should be consistent with 
Psrc = 0; conversely, if all the UHECRs come from the AGNs 
in the catalogue, then the data should be consistent with 
-Rbkg = 0. 

The full constraints on r src and -Rbkg implied by the 
PAO data are summarised in their joint posterior probability 
distribution, given by 

Pr(r src ,Rbk g |data) (2) 

= Pr(r Br c, -Rbkg) Pr(data|r src , R bkg ) 

J^L I^L> P r (r sr c, Rbkg) Pr(data|P src , Rbkg) dr src dR b k g ' 

where Pr(r src , Rbkg) is the prior distribution that encodes 
any external constraints on the rates, Pr(data|P src , Rbkg) is 
the likelihood of obtaining the measured data given particu- 
lar values for P src and Rbkg, and the integral in the denom- 
inator is the evidence. As we are not making any compar- 
isons to other models, the only role the evidence plays here 
is to ensure the posterior is correctly normalised; hence it 
can be ignored when investigating the shape of the poste- 
rior. We adopt a uniform prior over Rbkg ^ and P src ^ 0, 
which plausibly encodes our ignorance of these parameters 
and also includes the a priori possible value of zero for both 
rates (unlike the Jefferys prior, uniform in the logarithm of 
the rates). Hence the following posterior plots also show the 
likelihood - and, therefore, the constraining power of the 
PAO data - directly. Applying these simplifications, Eq. ((2| 
reduces to 

Pr(r src ,R bkg |data) cc G(r src )6(.R bkg )Pr(data|r src , R bkg ),(3) 

where Q(x) is the Heaviside step function. 

A self-consistent statistical treatment requires the use 
of intrinsic source and background rates, although these pa- 
rameters are not particularly intuitive themselves in the ab- 
sence of a physical model for the production of UHECRs. 
From those rates, however, we can calculate the expected 
number of source and background events in any sample, as 
well as the fraction of UHECRs that have come from AGNs, 
-Fagn. The constraints on the expected UHECR numbers are 
simply proportional to those on the relevant rates; -Fagn is 
given by the ratio of the expected number of AGN UHECRs 
to the expected total number. It is crucial that we begin by 
parameterising our problem with the fundamental physical 
quantities, the rates P src and .Rbkg, rather than the Fagn 
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(as done in lAbreu et aljfeoicf ); the latter is defined by the 
physical rates along with the energy range and observing 
footprint of a particular dataset, and hence the posterior 
distribution of -Fagn is derived from the posterior distribu- 
tion of the rates, given in Eq. ([3j . Moreover, in small samples 
in which the total arrival rate of UHECRs has a significant 
Poisson uncertainty, the only way to consistently account for 
the (independent) fluctuations in the source and background 
UHECRs is to parameterise their rates explicitly. 

3.1 The likelihood 

The likelihood is the probability of obtaining the observed 
data under the assumption of a particular model. Here, 
the data take the form of the measured arrival directions, 
{f c }, of the N c UHECRs (along with the value of N c it- 
self). It is also possible to use the measured arrival ener- 
gies of the UH E CRs, a possibility which is investigated in 
iMortlock et alj (|201ll ) but is not explored here. To evalu- 
ate the likelihood, we employ a 'counts in cells' approach, 
dividing the sky into N p = 180 x 360 = 64800 pixels dis- 
tributed uniformly in right ascension and declination. The 
data are hence recast as the set of UHECR counts in each 
pixel, {iVc,p}. In the limit of infinitely small pixels this is 
mathematically equivalent to the likelihood written dir ectly 
in terms of the arrival directions (jMortlock et al.ll201lf ) , but 
is more straightforward to analyse and simulate. 

The full likelihood of the data is a product of the inde- 
pendent Poisson likelihoods in each pixel, and is hence given 
by 



Pr({iV c , p }|r src ,i? 



bkgj 



n 

P =i 



(AWp + A^rc.p) C ' P eX P [-(Ai r bkg , :p + N src ,p) 
Nc,p\ 



(4) 



where Abk g , P and N SICtP are the expected number of back- 
ground and source UHECRs in pixel p, respectively. In the 
limit of small pixels, the denominator in Eq. Q can be ig- 
nored as iV G ,p! = 1 if there is never more than one UHECR 
in a pixel. 

The expected number of background UHECRs in pixel 

p is 



Abkg.p = -Rbkg j ~^Q^ ohs ' 



(5) 



where the integral is over the p'th pixel and de/df2 is the 
PAO exposure per unit solid angle (see Section l2~Tj) . The 
expected number of UHECRs from known sources in pixel 
p is 



N 



SYC.p 

£ 



diV a rr (-Bobs E m i B ,D B ) 

dtdA 



f- 



Pr(f obs |f s ) df/, ob s, (6) 



where the sum is over the AGN sources, 
Pr(fobsl^s) is the smearing probability (Eq. [T]), and 
dA^an- (_E b s ^ -Emm, D) / {At dA) is the rate (i.e. number 
per area and time) of UHECRs from a source at distance 
D arriving at Earth above the cut-off energy, -£/ m in- This 
rate is therefore proportional to the source rate, r src , but 
further depends on both the shape of the AGN CR injection 



spectrum and also the distance-dependence of the GZK 
energy losses, and so requires an explicit UHECR model 
(see Eq. [T0)> . 

The positional dependence of Abk g , P and N STClP are 
both shown in Fig. [T] The right panel is a combination of 
both the PAO exposure and the local distribution of AGNs, 
although comparing the left and the right panel it is clear 
that the latter dominates. In particular, by far the strongest 
source is Centaurus A (with I = 309? 5 and b = 19? 4), which 
has previously been suggested as th e dominant source of 
UHECRs (e.g. lAbraham et al.|[2007bh . 

It is possible within the Bayesian approach to as- 
sess whether any single UHECR came from a particular 
source, and the full fo rmalism for doing so is presented in 
IMortlock etlU] (|201lT ). However, a useful estimate of the 
probability that a UHECR, with measured arrival direction 
in pixel p, has come from one of the sources under consider- 
ation is 



Pr(from source \p, T src , R 



A s 



bkgj 



(7) 



:,p I - A b kg,p 

given values for the two rates. As the rates inferred from 
a sample of even just 27 UHECRs are not sensitive to any 
one event, it is reasonable to evaluate P src using the best-fit 
values of -Rbkg and P src to assess the likely origin of each 
UHECR in turn, and this is done for the PAO data in Sec- 
tion g] 



3.2 UHECR model 

We adopt a simple model for UHECR generation in which 
all AGNs emit UHECRs at the same overall rate and with 
a power-law energy flux of J oc E^ 1 . This implies a differ- 
ential emission rate of the form AN emit /AE oc E^ 1 ' 1 . The 
spectrum is normalised such that the total emission rate of 
UHECRs with energy greater than E is simply 



dA omi t(> E) 

At 



E 

Emu 



(8) 



where -E m i n = 5.7 x 10 19 eV is the minimum UHECR energy 
and r src is the rate at which each source emits UHECRs 
(as above). The UHECR luminosity of each AGN is hence 
Lstc = 7/(7 — 1) r^c-Emm- We take r src to be the same for 
all AGNs, although it is plausible that the UHECR emis- 
sion rate scales with an AGN's hard X-ray luminosity (e.g. 
IProtheroe fc Szabolll992f). We also fix the logarithmic slope 
at 7 = 3.6 (| Abraham et alj|20icl ). Deviati ons from these 
fiduci al models will be explored further in M ortlock et al.l 
l|201ll ). 

The dominant energy loss mechanism of UHECRs is the 
interaction with the CMB photons through the GZK effect. 
Although clearly a stochastic process, its most important 
feature is the exponential decrease in a UHECR's energy 
with distance. This can be captured by adopting a continu- 
ous loss approximation (cf. lAchterberg et alll 19991 1 in which 
a UHECR's arrival energy is given by 

Earr = max [Eqzk, E cmit {l - /gzk) D/Lgzk ] , (9) 

where D is the distance to the source, /gzk = 0.2 is the aver- 
age fractional energy loss per GZK int eraction, and .Lgzk = 
4 Mpc is the GZK mean free path (e.g. iRachen fc Biermannl 
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1 19931 ). It is also assumed that there are no further energy 
losses once a CR reaches Egzk, although this is unimpor- 
tant for UHECRs with Emin > -Egzk (such as those in the 
PAO sample analysed here). 

Combining Eqs Q and ([9| with the distance to the 
AGN then gives the rate per unit area of CRs arriving at 
Earth with energy E arr ^ -Emm from each AGN as 

W-D/L GZK 



D) 



(1 - /gzk) 1 



(10) 



dtdA 4tyD 2 

This can finally be used in Eq. ([6]) to calculate the expected 
number of CRs in each pixel, and therefore the likelihood 
(given in Eq. [4| . 

3.3 Simulations 

It is useful to test the constraining power of a small number 
of UHECRs by generating mock PAO samples with known 
progenitor properties. We created simulations of the two ex- 
treme cases: an AGN-only sample in which all the UHE- 
CRs were sourced from the nearby VCV AGNs and propa- 
gated using the simple GZK model described in Section \3. 21 
and an all-background sample in which the arrival direc- 
tions are random over the whole sky. In both cases the in- 
cident UHECRs were subject to the PAO's measurement 
errors and declination-dependent exposure. Both samples 
were constrained to have exactly 27 events so as to pro- 
vide parameter constraints that can be compared directly 
with those from the real PAO samplfl 

The results of the AGN-only simulation are shown in 
Fig. [2] As expected, the constraints on P src match the 
naive Poisson expectation; more interesting is the rejection 
of the possibility that more than a few of the PAO UHE- 
CRs are not from the VCV AGNs. The constraints on the 
AGN fraction (see Fig. [5]) from such a data-set would be 



Fag? 



1.00 



-11.00 



, where the quoted value is the maximum 



of the posterior, and all limits given in this paper enclose 
the most probable 68 % of the posterior. This strong result 
implies that, if AGNs source all UHECRs, even a sample of 
27 events would be sufficient to confirm this hypothesis if a 
complete catalogue of the progenitors was available. 

The results of the background-only simulation are 
shown in Fig. [3] Again, the constraints on -Rbkg match the 
Poisson expectation. The resultant constraints on the AGN 
fraction would be Eagn = 0.00t[^ (see Fig. [6|. However 
it is also important to note that some pixels (very far away 
from any AGN) have negligible contribution from the VCV 
AGNs, and because some of the UHECRs in this sample fell 
in those pixels, there is a strong upper bound on Eagn that 
is significantly lower than unity. 

The fact that the posteriors from the AGN-only and the 
background-only simulations are almost completely disjoint 
implies that even a sample of just 27 UHECRs might be 
sufficient to provide a definitive answer as to their origin. 
Given the observed distribution of the PAO UHECRs, the 
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Figure 2. The posterior probability of the UHECR rate from 
VCV AGNs, r src , and the uniform background rate, -Rbkg> im- 
plied from a simulated sample of 27 UHECRs, all of which were 
emitted by VCV AGNs. The contours enclose 68%, 95% and 
99.7% of the posterior probability, and the line plots show the 
marginalised probability for each rate. 
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Figure 3. Same as Fig. [2] but for a simulated sample of 27 
isotropically distributed UHECRs. 



parameter constraints from the real data should lie between 
the two extremes shown in Figs. [2] and [3] 



2 It would be inconsistent to draw N c from a Poisson distribution 
with mean of 27, as the observed number of UHECRs is already 
the result of a Poisson draw from the (unknown) mean number 
expected. One of the more convenient aspects of Bayesian pa- 
rameter estimation is that it is possible to obtain error estimates 
without the need for an ensemble of realisations. 



4 RESULTS 

The posterior probability distribution 111 1 sre and i?bkg given 
the PAO UHECR sample is shown in Fig. H The con- 
straints in this figure represent our main result and it is 
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Figure 4. Same as Fig. [2] but for all 27 PAO UHECRs. 
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Figure 5. Same as Fig. [2] but for the 22 PAO UHECRs which 
with arrival directions at least 10 dog from the Galactic plane. 



useful to discuss some of its features in more detail. As ex- 
pected, the posterior is intermediate between the extreme 
cases shown in Figs. [2] and [3] We find the marginalised rates 



7.4+" 



(equivalent to UHECR 
x 10 31 W) and P bkg = 



to be T src = (5.8I2.9) x 10 
source luminosity of L s 
(8.0ii;g) x 10" 17 sr" 1 m" 2 s" 1 . We also calculate the pos- 
terior distribution of the fraction of the PAO UHECRs that 
come from VCV AGNs, shown in Fig. [B] The most proba- 
ble value is Fagn = 0.15, and the constraints can be sum- 
marised by the interval Fagn = 0.15lg'o7 (where these lim- 
its enclose the most likely 68 % of the posterior probability). 

As most extragalactic catalogues are incomplete close 
to the Galactic plane we also repeated the above analysis 



on a reduced data-set from which the region with Galac- 
tic latitudes of |6| ^ 10 deg had been removed (see Fig. [5}. 
The PAO exposure in the retained regions is 7480 yr km 2 sr 
and the number of UHECRs included was reduced from 
27 to 22. The lower numbers resulted in slightly broader 
constraints on Fagn, as can be seen from Fig. [5] From 
this cut data, we find r src 



(7.6 



-2.0\ 
-1.7J 



x 10" 



_(5.6tl; a 8 ) x 10 J 
sr 1 m 2 s 1 and Fag? 



S , -Rbkg 

+0.11 

0.09- 



0.18 



The analysis was also repeated using larger mean smear- 
ing angles of a — 6 deg and a — 10 deg. The limits on 
the AGN fraction in these models are Fagn = 0.22+q'J 2 , 
{a = 6 deg) and Fagn = 0.31±^ 2 (a = 10 deg). In both 
cases the most probable value of Fagn is higher than in 
the fiducial model, although a broader range of Fagn val- 
ues is compatible with the data as well. It is natural that 
a higher AGN fraction be compatible with the data given 
larger values of a, as a greater fraction of the sky is within 
a o f at least on e sourc e, a nd this effect ha s been seen by 
e.g. iKim fc Kiml ll201ll) an d lAbraham et all (|201Cl ) . In par- 
ticular, IKim fc Kiml (|201ll ) report the fraction of observed 
UHECRs that originate from AGNs to be 0.45 for a smear- 
ing angle of 6 deg. However the best-fit value of Fagn in- 
creases less strongly with a in the Bayesian formalism we 
present than found by using other methods; the inherent 
self-consistency of the Bayesian approach ensures the cor- 
rect balance is struck between the compatibility of this more 
forgiving model and the lack of predictivity. 



There is strong evidence of a UHECR signal from the 
known VCV AGNs, which manifests in the result that 
-Fagn = 0.15 is 200 times more probable than Fagn = 0.00, 
but not all the PAO UHECRs can be explained this way. 
These results could also be cast in terms of model compar- 
ison if only the background-only (i.e. Fagn = 0) or the 
AGN-only (i.e. Fagn = 1) possibilities were considered. 
The former cas e , which is the null hypothesis rejected by 
lAbraham et al.l (|2007bf ) . is actually reasonably consistent 
with the data, whereas the hypothesis that all the PAO 
UHECRs come from VCV AGNs is completely ruled out be- 
cause there are several events with no plausible AGN pro- 
genitor in the VCV catalogue. The probability that each 
of the 27 UHECRs came from one of the VCV AGNs was 
calculated explicitly according to Eq. (J7} by adopting the 
best-fit values for i?bk g and r src given above; these proba- 
bilities are given in Table [1] For a = 3 deg, only 9 events 
have P S rc ^0.1, all of which were identified as being within 



3.2 deg of an AGN with z < 0.017 by 
2007b). However the other 11 events which 



Abraham et al 



Abraham ct al 



2007bl ) identified as AGN correlated have very low values 
of Psrc, in most cases because the angular correlation is with 
an AGN that is close to their maximum redshift and so has a 
significantly reduced UHECR flux at Earth. Moreover, 14 of 
the UHECRs have P src < 0.001, with no plausible AGN pro- 
genitor, at least within the VCV catalogue. As also shown in 
Table [TJ the results are similar, but less conclusive, for larger 
smearing angles. The AGN hypothesis cannot be ruled out 
for the low P src events, however: these UHECRs could have 
come from AGNs that are not in the VCV catalogue (and 
some could have come from VCV AGNs if deflected by more 
than a few degrees). 
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Table 1. The measured arrival directions of the 27 PAO UHECRs 
listed in Abraham ct al. (2007b) along with their assessment of 
of AGN correlation (PAO) and our values of the AGN progenitor 
probability (which is rounded to zero if less than 0.0005) for the 
three different smearing angles. The CRs marked with *1, *2 and 
*3 in the b column are those closest to Centaurus A, with angular 
separations of 0.9, 2.3 and 5.8 deg respectively. 
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0.000 


0.000 
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yes 
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yes 


0.000 


0.134 


0.387 


—27.7 


— 17.0 


yes 


0.099 


0.067 


0.033 


—31 4 


13.0 


yes 


0.078 


0.171 


0.424 


—75.6 


—78.6 


yes 


0.380 


0.528 


0.493 


58.8 


—42.4 


yes 


0.000 


0.000 


0.000 


—52.8 


14. I* 3 


yes 


0.870 


0.836 


0.711 


4.2 


-54.9 


yes 


0.000 


0.004 


0.008 


48.8 


-28.7 


yes 


0.000 


0.000 


0.000 


-103.7 


-10.3 


no 


0.000 


0.000 


0.001 


-165.9 


-46.9 


yes 


0.000 


0.003 


0.010 


-27.6 


-16.5 


yes 


0.099 


0.067 


0.033 


-52.3 


7.3 


no 


0.167 


0.533 


0.577 


88.8 


-47.1 


yes 


0.000 


0.000 


0.002 


-170.6 


-45.7 


yes 


0.000 


0.006 


0.011 


-51.2 


17.2* 2 


yes 


0.952 


0.873 


0.735 


-57.2 


41.8 


no 


0.005 


0.123 


0.294 


63.5 


-40.2 


yes 


0.000 


0.000 


0.000 


-51.4 


19.2* 1 


yes 


0.964 


0.881 


0.742 


-109.4 


23.8 


yes 


0.000 


0.000 


0.002 


-163.8 


-54.4 


yes 


0.001 


0.006 


0.020 


-41.7 


5.9 


no 


0.002 


0.208 


0.454 


12.1 


-49.0 


yes 


0.000 


0.001 


0.003 


-21.8 


54.1 


yes 


0.000 


0.005 


0.088 


-65.1 


34.5 


no 


0.000 


0.049 


0.321 


-125.2 


-7.7 


no 


0.001 


0.002 


0.002 



5 CONCLUSIONS 

We have performed a Bayesian analysis to test whether the 
first 27 UHECRs with E ohs ^ 5.7 x 10 19 eV detected by the 
PAO (i.e. those observed before 2007 September) have come 
from the known local AGNs in the VCV catalogue. The first 
main conclusion from this analysis is that at least some do - 
or at least come from progenitors within a few degrees of the 
VCV AGNs. The fraction of UHECRs that come from the 
VCV AGNs is constrained to be 0.15jlo o?- Conversely, our 
second important result is that many of the PAO UHECRs 
have not come from AGNs in the VCV catalogue, either be- 
cause of incompleteness (most obviously close to the Galac- 
tic plane) or because there is another source of UHECRs, 
possibly in our own Galaxy. 

Our results differ somewhat from those presented by 
lAbraham et al.l (|2007bl ) due to our more explicit modelling 
of background and source events as well as the different sta- 
tistical methods used. The starting point of their analysis is 
the null hypothesis that the UHECRs have not come from 
local AGNs; they find that this is rejected 'at the 99% level' 
given the number of the UHECRs that are within 3 deg of 
a VCV AGN. This result was as expected, which illustrates 
the potentially circular reasoning when the obvious simple 
null hypothesis does not match prior knowledge (i.e. the ex- 
pectation that some of the UHECRs did, in fact, come from 
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Figure 6. Posterior distributions of the fraction of observed 
UHECRs that are from the population of VCV AGNs, F AGN , 
shown for simulated samples (both isotropic and AGN-only) and 
for the real PAO data. Curves for both the full sample of 27 UHE- 
CRs and the cut sample of 22 UHECRs (with arrival directions 
at least 10 deg from the Galactic plane) are shown in the same 
panel. Each panel represents a different smearing angle. 



the AGNs), although the strength of the correlation made 
it clear that there was at least some connection between 
the two populations. But it is impossible to go beyond this 
limited statement due to the use of arbitrary cuts in the 
correlation analysis (both in angular radius and AGN red- 
shift), the equal weighting of the nearest known AGN, Cen- 
taurus A, with the hundreds of AGNs at distances of about 
100 Mpc, and the equal value placed on any angular match 
out to about 3 deg, which dilutes whatever correlation sig- 
nal is present (and also increases the chance that a non-AGN 
sourced UHECR is assigned a spurious match). The simu- 
lations of AGN-only and background-only UHECR samples 
shown here demonstrate that even a sample of just 27 events 
is sufficient to decisively distinguish between these two ex- 
treme possibilities, but also that the appar ent strong cor- 
relation inferred by lAbraham et al.l (|2007bl ) is in part due 
to the analysis method. By introducing a model of both 
the AGN-sourced UHECRs and a uniform background, the 
Bayesian analysis can be thought of as giving the optimal 
weight to any potential UHECR-AGN pairing, given our 
prior knowledge of the physics of UHECR propagation and 
the measurement process. 

During the final preparation of this paper, the Pierre 
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Auger Collaboration present ed an extended an alysis of an 
enlarged set of 69 UHECRs (|Abreu et alj|2010h . also com- 
paring the UHECR arrival directions with other extragalac- 
tic catalogues. Aside from the correlation-based methods 
they had used previously, they also included a likelihood- 
based formalism that has some similarities to our method. 
The results of the two approaches are broadly similar 
(whilst differing from the earlier correlation-based analy- 
ses), primarily because they both include a physical model 
of UHECR propagation. They hence go closer to the ideal 
of including all the available information (i.e. not just the 
data but knowledge of the CR physics) and so produce more 
robust results. 

In addition to applying the methods described here 
to this enlarged dataset, there are several extensions to 
our analysis that will allow more rigorous conclusions re- 
garding the origins of these particles. Most importantly, 
we can account for the energy of individual CRs in our 
likelihood, rather than just demanding they are above the 
Emin = 5.7 x 10 19 eV cut. This, in turn, will make it more 
important to use a more realistic, stochastic calculation of 
the GZK effect, and also the energy-dependent CR deflection 
due to magnetic fields. It will similarly be more important to 
investigate the possibility that the AGN UHECR emission 
rate scales with AGN luminosity; a corollary is that it may 
be possible to discriminate between different AGN emission 
models. 

In future work, we will investigate whether UHECR 
data could be used yet more efficiently by including lower- 
energy events. This would obviously increase the numbers, 
although there is the potentially severe penalty of diluting 
the angular signal by including UHECRs that have either 
been deflected by more than about 10 deg or have come 
from the many AGNs at distances of greater than about 
100 Mpc. To the degree that the CR propagation and deflec- 
tion models are accurate, these trade-offs can be evaluated 
objectively, following the underlying principle of extracting 
as muc h information as poss ible from the UHECR measure- 
ments l|Mortlock et alj|201ll ). 

Another way to potentially make better use of UHE- 
CRdata would be to use a more homogeneous AGN sample 
than the VCV catalogue. An obvious example is the cata- 
logue o f AGNs from the Sw ift Burst Alert Telescope (BAT) 
survey jTueller et alj|2008l ). which has nearly uniform selec- 
tion c riteria outside the Galactic plane. Both George et all 
(|2008i ) and the latest PAO analysis from lAbreu et al.1 (|201of ) 
compare UHECR data to this catalogue. In particular, 
I George et al.l d2008T) approach the an alysis in a fashion sim- 
ilar to that of lAbraham et all J2007b | ) an d found correlation 
at the '98% level' . iMortlock et al.l (|201ll ) will apply the fully 
Bayesian methods described in this paper to the BAT AGNs, 
as well as extending the approach in order to provide a more 
rigorous analysis. By combining optimal statistical methods 
with the ever-increasing UHECR data-sets it should soon be 
possible to definitively determine the origins of UHECRs. 



d'Orfeuil and Subir Sarkar. Abraham Achterberg's help was 
also invaluable, in particular by providing an unpublished 
manuscript. Roberto Trotta provided valuable input during 
the early stages of this project. This paper was improved 
thanks to a number of thoughtful comments by the anony- 
mous referee. 
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